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SYSTEM AND METHOD FOR A FAILOVER PROTOCOL IN STORAGE AREA 

NETWORK CONTROLLERS 

TECHNICAL FIELD 

The present disclosure relates generally to the field of storage area networks and, 
more particularly, to an enhanced system and method for multipath failover for storage systems with 
multiport controllers. 

BACKGROUND 

A storage area network (SAN) maybe used to provide centralized data sharing, data 
backup, and storage management. A storage area network is a high-speed network of shared storage 
systems. Elements of a SAN include servers, switches and storage systems. A storage system is a 
device that contains a single disk or multiple disks for storing data for a computer system or 
computer network. A storage system may also contain single or multiple storage controllers for 
controlling the disks in the system. Each storage controller is itself a hardware device and may have 
multiple ports. The storage units of a storage system are accessed through the ports and storage 
controllers of the storage system. Storage units are often called LUNs and are typically the storage 
entity mounted and accessed by servers of the SAN. "LUN" is a SCSI acronym for logical unit 
number. The logical unit number addresses one of many storage units that may be present in a SCSI 
target. This disclosure will use the term storage unit instead of LUN. A collection of storage units 
is sometimes referred to as a storage pool. 

SANs are typically deployed in operating environments that require high availability, 
necessitating in the design of the networks redundancy and other mechanisms to provide continuous 
availability in the event of component failure. The storage systems in a SAN can be collocated, 
which allows for easier maintenance and easier expandability of the storage pool. In a typical 
network architecture of a SAN, all of the storage units in the storage pool are potentially available 
to all the servers of the SAN. Additional storage units can be easily added to the storage pool by 
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adding additional disks to the present set of storage systems or by adding additional storage systems 
together with additional disks. These new storage units will also be accessible from any server of 
the SAN. A server of a SAN can act as a pathway or transfer agent between the end user and the 
stored data on the storage units. Each server of a SAN is usually connected to the SAN by two or 
more host bus adapters (HBAs). Each server also typically includes a communications or multipath 
driver, which governs the communication between the server and the storage units of the SAN. 
Network servers may be coupled to the SAN according to the Fibre Channel protocol. 

A storage system on a SAN is typically identified by its Node Worldwide Name 
(WWN). A storage controller on a SAN is typically identified by one or more Port WWNs present 
on a storage systems with a specific Node WWN. A storage unit on a SAN is typically identified 
by a unique SCSI logical unit number (LUN) within the storage systems Node WWN. Each storage 
unit of the SAN can be assigned to one or more predetermined servers of the SAN according the 
LUN address of the storage unit. 

A single storage system in a SAN may be configured to include multiple storage units, 
which are managed by multiple storage controllers. Storage units on a storage system are accessed 
through a storage path. The storage path through each storage system is directed by the multipath 
driver that is installed on each server of the SAN. As an example of a storage path, a multipath 
driver may identify that a particular storage unit in the SAN is to be accessed through a 
predetermined port and predetermined storage controller in the storage system. Because a storage 
unit may be accessed through multiple storage paths, a multipathing driver is required in the server 
to select from the available storage paths. 

The storage paths between a server of the SAN and a storage unit may include a 
default path and a number of alternate paths. In the event that the storage controller or the port of 
the storage controller of the default path is rendered inoperable, the multipath driver identifies 
alternate paths for accessing the storage unit. The selection of an alternate path occurs automatically 
and is conducted according to the failover protocol of the multipath driver. Often, however, the 
failover protocol of the multipath driver selects a secondary or alternate path that is also inoperable. 
For example, a failure of the storage controller will cause all alternate paths through the storage 
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controller to become inoperable. Because multiple ports may be coupled to the failed storage 
controller multiple alternate paths will be affected by the failure of the single storage controller. 
Known failover protocols involve a selection of a storage path without regard to the logical 
relationship between the ports and the controllers of the SAN. 
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SUMMARY 

In accordance with the present disclosure, a method and system is provided for 
managing the failover protocol in a storage area network. The failover protocol of the present 
disclosure accounts for the relationship between the ports and storage controller of storage systems 
that include multiple ports and multiple controllers. The failover protocol of the present invention 
includes an analysis of the available storage paths from a particular server on which the mulitpath 
driver resides and the set of storage units on the SAN that are configured to be accessible by the 
server. This information may be included in an inquiry page or in a database that is presented by the 
storage system and is accessible by the multipath driver on each server in the SAN. The multipath 
driver accesses the inquiry page or database to determine the association between ports and storage 
controllers of each storage system. When a selected storage path fails or becomes inoperable, the 
multipath driver of the affected server selects an alternate storage path that does not include the 
storage controller of the failed storage path. 

A technical advantage of the present disclosure is a failover protocol that improves 
the performance of the SAN during failover conditions. Instead of selecting alternate storage paths 
that include the storage controller of the failed storage path, the failover protocol described herein 
selects a path on a storage controller that is not the controller attached to the failed path. In this 
manner, the failover protocol does not result in the expenditure of additional time by selecting an 
alternate storage path that has a high probability of including a failed storage controller. Other 
technical advantages will be apparent to those of ordinary skill in the art in view of the following 
specification, claims, and drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

A more complete understanding of the present embodiments and advantages thereof 
may be acquired by referring to the following description taken in conjunction with the 
accompanying drawings, in which like reference numbers indicate like features, and wherein: 
Figure 1 is a block diagram of an embodiment of a storage area network; and 
Figure 2 is a flow diagram of a method for selecting a storage path through a storage 

area network. 
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DETAILED DESCRIPTION 

The present disclosure concerns a method and system for a storage area network 
(SAN) multipath failover protocol. The protocol described herein involves a determination and 
analysis of the logical association between ports, storage controllers, and storage units (often referred 
to as LUNs) of each storage system of the SAN. If a storage path of the SAN is determined to be 
inoperable, an alternate storage path is selected according to the disclosed failover protocol. Because 
the disclosed failover protocol involves a determination and analysis of the association of ports and 
storage controllers of each storage system, the disclosed storage protocol provides for the selection 
of an alternate storage path that has a greater likelihood of being operable in the event of a failure 
of a storage path. 

Shown in Figure 1 is a diagram of a storage area network, which is indicated generally 
at 10. SAN 10 includes a server 12 and a storage system 14. Although a SAN may include multiple 
servers and multiple storage systems, the example of Figure 1 includes a single server coupled to a 
single storage system. Server 12 and storage system 14 are coupled to one another across a 
switching network 16. Switching network 16 is coupled to server 12 through the host bus adapters 
25 (HBAs) of server 12. Storage system 14 includes a number of ports 18, which are further 
identified as ports P0-P7. Each of ports 18 is coupled to one of two controllers 20, which are further 
identified in Figure 1 as Controller 1 and Controller 2. In the example of Figure 1, ports P0-P3 are 
coupled to Controller 1, and ports P4-P7 are coupled to Controller 2. Storage system 14 includes a 
number of storage units 22. Each storage controller 20 is able to access each storage unit 22. 

Server 12 includes a multipath driver 26. Multipath driver identifies the default path 
and the alternate paths between server 12 and each of the storage units 22. Multipath driver 26 
accesses an inquiry page 27. Each storage system maintains an inquiry page 27, which is a unique 
data page of information concerning the configuration of the storage system! By accessing the data 
of inquiry page 27, the multipath driver can determine alternate paths between the server or initiator 
12 and the storage units 22. The inquiry page includes data that identifies the unique descriptor or 
identifier for each storage unit in the SAN. The inquiry page also includes data that identifies the 
storage system that presents or includes the storage unit in the SAN. The inquiry page also identifies 

HOU03:825991.3 



ATTORNEY' S DOCKET PATENT APPLICATION 

016295.0692 (DC-03100) 

7 

the port number through which the storage unit is currently being accessed by the server or initiator. 
The inquiry page also identifies the physical controller on which the port through which the storage 
unit is accessed resides. Multipath driver 26 issues inquiries to inquiry page 27 to access the data 
of the inquiry page. Multipath driver 26 may construct a data table 29 of data retrieved from inquiry 
page 27. 

As an example of the association of storage units, controllers, and ports in a storage 
system, the default path between server 12 and LUN 0010 of Figure 1 might be port PI through 
Controller 1 to storage unit LUN 001 1 . The association of the storage unit 0010 to port PI , and the 
association of Controller 1 to port PI is recorded in the inquiry page. As a second example, the 
default path between server 12 and LUN 0110 might be port P7 through Controller 2 to LUN 01 10. 
Similarly, the association of LUN 01 10 to port P7, and the association of Controller 2 to port P7 
would be recorded in the inquiry page. As part of the protocol for identifying alternate paths for 
accessing each storage unit, multipath driver 26 accesses inquiry page 27 to build a record of the 
available paths for accessing each storage unit 22. As an example, the record of available paths for 
accessing LUN 0010 is: 

P0 : Controller 1 : LUN 0010 
PI : Controller 1 : LUN 0010 
P2 : Controller 1 : LUN 0010 
P3 : Controller 1 : LUN 0010 
P4 : Controller 2 : LUN 0010 
P5 : Controller 2 : LUN 0010 
P6 : Controller 2 : LUN 0010 
P7 : Controller 2 : LUN 0010 

In the example of Figure 1, multipath driver 26 is able to identify that ports P0-P3 are associated with 
Controller 1 and ports P4-P7 are associated with Controller 2. 

The correlation of ports and controller permits the multipath driver to identify a 
suitable failover path in the even that the default of selected storage path is inoperable or times out. 
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The correlation of ports and controllers in the inquiry page permits the multipath driver to access the 
inquiry page to assign a failover path that includes a controller that is different than the controller 
of the currently selected storage path. As an example, assume that server 12 is accessing LUN 0010 
through port P 0 and Controller 1 . If this path becomes inoperable or times out, the multipath driver 
can assign a failover path that includes a port that is not associated with the controller of the 
inoperable path. In this example, a storage path encountered by the multipath driver that is not 
associated with Controller 1 is through port P4 and Controller 2. 

The selection of an alternate storage path through Controller 2 averts the possibility 
that the selection of a second storage path that accesses the selected storage unit through Controller 
1 will also be inoperable. If the failure of Controller 1 is the cause of the inoperable default storage 
path, all other storage paths through Controller 1 will be disabled. Selecting a storage path through 
Controller 2 provides a greater likelihood of locating a functioning storage path in a much shorter 
time frame. 

Shown in Figure 2 is a flow diagram of the method followed by the multipath driver 
to determine the association of ports and controllers and select an alternate storage path in the event 
of a failure in the default or selected storage path. At step 30, upon initialization of the SAN, each 
multipath driver of the SAN identifies the available storage paths for accessing each storage unit in 
the SAN. Each storage path includes an identification of the port, storage controller and storage unit 
of the path. Following the determination of the available storage paths for each storage unit, the 
multipath drivers at step 32 access the inquiry page to determine the associations between storage 
ports and storage controllers. At step 33, the multipath drivers identify a primary path and at least 
one alternate path. The alternate path will differ from the primary path in that the alternate path will 
include a controller that is different than the controller of the primary path. 

At step 34, the server attempts to access a storage unit through the selected storage 
path, which in this instance is the primary storage path. If the primary storage path has been 
determined to inoperable at step 36, the multipath driver at step 37 selects an alternate path for data 
accesses between the server and the selected storage unit. The selection of the alternate path 
involves a selection of a storage path that includes a storage controller that is different from the 
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storage controller of the existing storage path. As such, regardless of the number of ports on the 
storage controller of the existing storage path, and regardless of the number of alternate storage paths 
through the storage controller of the existing storage path, an alternate storage path is selected that 
does not include the storage controller of the existing storage path. Once an alternate storage path 
is selected, processing continues at step 34 with an attempt by the server to access the storage unit 
through the alternate storage path. As an alternative, the alternate path that involves a separate 
controller may be a second alternate path, with the first alternate path being a path that includes the 
same controller as the primary storage path. 

Although the present disclosure has been described in detail, it should be understood 
that various changes, substitutions, and alterations can be made hereto without departing from the 
spirit and the scope of the invention as defined by the appended claims. 
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